per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Imemational Buruu 




B\3 



INTERNATIOJiJAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Oassificatioo ^ : 

G06F3/14, 9/00, G06K9/36 
G06K 9/46, 9/00 



Al 



(11) International Publication Nnmber: WO 94/03853 

(43) International Publication Date: 17 February 1994 (17.02.94) 



(2 1) International Application Number : PCT/US93/06883 

(22) International Filing Date: 22 July 1993 (22.07.93) 



(30) Priority data: 
07/921.831 



29 July 1992 (29.07.92) 



US 



(71) Applicant: COMMUNICATION INTELLIGENCE COR- 

PORATION (US/US]; 275 Shoreline Drive, Redwood 
Shores. CA 94065 (US). 

(72) Inventor: OSTREM. John, S. ; 777 San Antonio Road, 

#125. Palo Alto, CA 94303 (US). 

(74) Agent: ALLEN, Kenneth, R.; Townsend and Townsend 
Khourie and Crew, One Market Plaza, 20th Fl., Steuart 
Tower, San Francisco, CA 94105 (US). 



(81) Designated States: CA, DE, GB, JP, KR. European patent 
(AT, BE, CM, DE. DK, ES, FR, GB. GR, IE, IT. LU. 
MC. NL. PT. SE). 



Published 

Wuh imemational search report. 



(54) Title: A METHOD AND APPARATUS FOR COMPRESSION OF ELECTRONIC INK 



(57) Abstract 

A method and apparatus are provided for compressing data 
received from a digitizer tablet based on the characteristics of each 
individual stroke contour (150, 160). The digitizer tablet samples 
the position of the writing pen, continuously transmitting data to a 
computer in the form of x and y coordinates plus an indicator of 
whether or not the pen is touching the surface of the tablet (100. 
1 10). After preprocessing, a determination is made of which points 
along the stroke contour are essential to the reconstruction of a 
quality facsimile of the original writing from the sampled points 
(140). This determination is based upon local curvature, local ex- 
trema, and the endpoints of the stroke. In an optional second stage 
of compression, a standard mathematical compression technique 
is applied (160). 
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A METHOD AND APPARATU S FOP rOM PRES.STOM ^.r^^^^^jj^^^ 

BACKGROUND OF THE INVENTION 
The present invention relates generally to data 
capture applications, and .ore specifically, to the compression 
of data generated on a digitizer tablet. 

When something is written on a digitizer tablet the 
output xs a parametric representation of the writing 
trajectory; that is, the writing is represented as a series of 
X, y coordinate values as a function of time, a typical 
digitizer tablet samples the position of the writing pen loo 
.imes per second, at fixed intervals of o.Ol seconds. The 
tablet continuously transmits the resulting data to the 
receiving computer. Thus every 0.01 seconds, the tablet 
samples the pen position and transmits an x-coordinate value, a 
127 -^ditional value which indicates 

Whether the pen is up or down. The data generated in this 
manner is commonly referred to as electronic ink. 

There are applications where it is of interest to 
capture and save the electronic ink. For example, it may be 
desirable to save an electronically generated signature for 
later signature recognition. Another application is the 
permanent storage of data which has been generated on a 
digitizer tablet. 

The problem with saving electronic ink is that the 
storage requirements can be quite larae i f „ • • , 

daffl i= ^ quiTie large if the original sampled 

data IS stored directly. Therefore there is a need for a data 
re uction technique for electronic ink that can achieve high 
data compression and yet be efficient to compute. The 
computational efficiency is important because electronic ink 
may be used on computers with relatively modest CPU power 
TILITT ''"'^^ "'^^^ Significantly slowed if 

t oL '^^'"'^"^ ^° ^^^^^^^ t° -ch pen 

Which b : ' continuous segment 

Which begins when the pen touches the digitizer tablet's 
surface and ends when the pen is lifted from the surface.) 
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There are a number of methods for data compression. 
The earliest methods were based on the idea of minimum 
redundancy coding. The basic idea was very simple; symbols 
which occur frequently are coded with fewer bits than those 
5 which occur more rarely. The earliest known method to apply 
this technique was the Shannon-Fano coding. A subsequent 
development was Huffman coding, which is similar to Shannon- 
Fano coding in many ways. Huffman's contribution was in 
devising a very clever and elegant way to construct the coding 

10 and decoding trees. In general, simple Huffman coding does not 
achieve the same degree of compression as more modern 
techniques, but processing and memory requirements are 
relatively modest. 

A further enhancement of Huffman coding is called 

15 adaptive Huffman coding. A basic problem with standard Huffman 
coding is that the decoding program must have access to the 
Huffman coding statistics. Hence, trying to increase 
compression with more comprehensive statistics is eventually 
self-defeating since more and more coding statistics must also 

20 be saved, which works against the compression. Adaptive 
Huffman coding gets around this problem by adjusting the 
Huffman decoding tree on the fly, effectively gaining the 
advantages of higher order modeling with r.o additional storage 
for statistics, 

2 5 Huffman coding requires that an integral number of 

bits be used to code symbols. Arithmetic coding improves 
compression by using a fractional number of bits per code. 
Although arithmetic coding will always provide as good or 
better compression than Huffman coding, its disadvantage is a 

3 0 significantly larger computational burden. 

Dictionary coding 'is another class of data 
compression. In general, these techniques create a dictionary 
of symbol sequences or combinations which are then accessed by 
indexing into the dictionary. These techniques have become the 
35 standard for compression on small computers because they 
combi:.a good compression with modest memory requirements. 
Examples of codes using these techniques which are applicable 
to MS-DOS include PK2IP, ARC, ARJ, and Lharc, 
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There are also compression techniques that apply 
primarily to specialized domains. An example is compression of 
two dimensional images using the discrete cosine transform and 
its many derivatives. 

A method and apparatus for compressing electronic ink 
which achieves high compression while requiring minimal memory 
and CPU power is needed. 



SUMMARY OF THE INVENTION 
According to the invention, for a computer with a 
digitizing tablet, a method and apparatus are provided for 
compressing data based on the characteristics of each 
individual stroke contour. 

The digitizer tablet samples the position of the 
writing pen a pre-determined number of times per second and at 
fixed intervals. This data is continuously transmitted to a 
computer in the form of x and y coordinates plus an indicator 
of whether or not the pen is touching the surface of the 
tablet. 

In the preferred embodiment the incoming data from 
the digitizer tablet is preprocessed. This preprocessing, 
which is application and/or system specific, is not part of the 
compression. Examples of preprocessing include smoothing for 
noise reduction, andpoint filtering, discarding duplicate data 
points, median filtering, Manning filtering, spatial sampling, 
and size normalization. 

In the ,first stage of compression a determination is 
made of which points along the stroke contour are essential to 
the reconstruction of a quality facsimile of the original 
writing from the sampled points. The method proceeds by 
calculating descriptors and finding essential features of the 
stroke contour, in the preferred embodiment, the descriptors 
are local curvature, local extrema, and the endpoints of the 
stroke. Although the actual algorithm is somewhat detailed, in 
general the idea is that the extrema are saved, the points 
about which the local curvature is above a threshold are saved, 
and the endpoints of the stroke are saved. 
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In the second stage of compression any number of 
standard compression techniques can be applied including 
Huffman coding, adaptive Huffman coding, arithmetic coding and 
a variety of dictionary compression techniques, m the 
preferred embodiment differences of the form Ax = - x- and 

- Yi - yi.i are calculated and the stage 2 compressiorL 
applied to the differences. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a flow chart of the general compression 
method according to the invention. 

Fig. 2 is a flow chart of the extrema determination 

Step, 

Fig. 3 is a flow chart of the stroke endpoint 
determination step. 

Fig. 4 illustrates the local curvature calculations 
Fig. 5 is a flow chart of the duplicate data points 
preprocessing step. 

Fig. 6 illustrates the endpoint filtering 
preprocessing step. 

Fig. 7 is a flow chart of the endpoint filtering 
preprocessing step. 

Fig. 8 is a flow chart of the median filtering 
preprocessing step. 

Fig. 9 is a flow chart of the Hanning filtering 
preprocessing step. 

Figs. lOA-c illustrate the reconstruction of an 
electronically written signature without applying this 
invention's compression techniques. 

Figs. liA-C illustrate the reconstruction of an 
electronically written signature after applying this 
invention's compression techniques. 

Figs. 12A-C illustrate the reconstruction of a 
sequence of lower case letterc: =f=4-«>. = -i • 

letters after applying this invention's 

compression technique. 

Figs. 13A-C illustrate the reconstruction of a simple 
Une after applying this invention's compression technique. 
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Figs. 14A-E illustrate the trade off between 
compression ratio and reconstruction fidelity. 

Fig. 15 illustrate the apparatus to accomplish the 
disclosed method. 

DESCRIPTION OF THE SPECIFIC EMBODIMENT (S) 
The flow chart of Fig. 1 illustrates the compression 
method described by this invention. Data is input by writing 
on a digitizer tablet (step lOO) . The tablet represents the 
writing as a series of x and y coordinate values as a function 
of time (step lio) . This stream of x,y position coordinates 
along with a pen up/pen down indicator for each coordinate pair 
is continuously sent to the computer (step 120) . 

In the preferred embodiment, prior to data 
compression, the data is preprocessed (step 130) . The exact 
form of preprocessing will vary according to the application 
and system. Preprocessing is generally used to remove noise. 
The details are unimportant as long as a good representation of 
the written input data is maintained. Another optional step 
which is dependent upon the application and system is the 
removal of duplicate or close points (step 140) . This step is 
generally necessary when the digitizer tablet's user writes 
slowly. Slow writing causes oversampling resulting in duplicate 
or close points which are redundant and do not carry 
significant information. 

The first stage of compression is based on the 
characteristics of each individual stroke contour (step 150) . 
In the second stage of compression any of the standard 
mathematical compression techniques can be applied (step 160) . 
Although this stage of compression is not required by the 
invention, it is employed in the preferred embodiment as a 
potential means of further compressing the data. 

After the completion of the second stage, a 
comparison is made between the first stage and the second stage 
compression (step 170). If the compression was not improved 
after applying stage two, then the stage one compression data 
is saved (step 180). if the data was further compressed by 
stage two, then this compression data is saved (step 190). 
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The computer program modules used to compress the 
electronic data are given in the appendix. Significant program 
modules and their specific functions are as follows: 

Stage one compression: 

calc_sav.c Function to detennine which points in the 

signature need to be saved, based on various 
criteria. 

curvature. c Function to calculate the local curvature at a 

point along the contour. 
Stage two compression: 

c»psig2.c Function to compress an array of stage 1 data 

with characteristics of the signature 
verification template data. 

pack_half bytes. c 

Function to pack the value to be compressed into 
half bytes. 

set_4_bits Function to pack values into the upper or 

lower 4 bits of a byte. 
get_4_bits Function to unpack values from the upper or 

lower 4 bits of a byte. 
uncmpsig2.c Function to uncompress an array of data 

compressed by the function cmpsig2.c. 
unpack_hal f bytes . c 

Function to unpack and reconstruct the value 
from half bytes. 

c=.„. • '^"^iled Illustrations of the stage one 

IZ :T°" ""'"'^ ' ^ ^--^ the first 

step Of compression in which extre.a are determined and saved 
The digitizer data is in the form „f . saved, 
values V V sequence o£ coordinate 

alues, Xi and y^, as a function of tiBe (step 2io) . The 
subscript i varies fro. i.„ to i.nsa»ps-2 where nsanps is the 
total nuBber of data points in a stroke. The subroutine is 
initialized by setting i equal to 1 (steo 220,. 

value Of -'eteiBined by coBparing- the 

value of each data point to the values of the data points 
airectly adjacent to it (step 230). To be a BaxiBa, the value 
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^^of either the x coordinate or the y coordinate must be larger 
then the values of the corresponding coordinate of the data 
points on both sides of the data point under review (step 230) , 
To be a minima, the value of either the x coordinate or the y 
coordinate must be smaller then the values of the corresponding 
coordinate of the data points on both sides of the data point 
under review (step 230) . 

If any of the four relationships defined by step 230 
are met (step 240) , then that data point will be saved (step 
250) , otherwise the data point is not saved. After determining 
whether or not to save a data point, the value of i is 
increased by one (step 260) and a determination is made as to 
whether or not all data points have been reviewed (step 270), 
If all data points have been reviewed then this subroutine ends 
15 (step 280), otherwise the subroutine loops back to the 

comparison step (step 230), continuing until all data points 
have been reviewed. 

Fig. 3 shows a flow chart of the second step of 
compression in which stroke endpoints are determined and saved. 
2 0 The digitizer data is in the form of a sequence of coordinate 
values, x^ and y^, as a function of time (step 310) . The 
subscript i varies from i=0 to i=nsamps-l where nsamps is the 
total number of data points in a stroke. The subroutine is 
initialized by setting i equal to 0 (step 315). 

This subroutine first determines whether or not the x 
and y coordinates of the data point indicate that it is the 
first data point of the stroke (step 320), If it is the first 
data point, then it is saved (step 330). Next the subroutine 
determines whether or not the x and y coordinates of the data 
point indicate that it is the last data point of the stroke 
(step 340). If it is the last data point of the stroke, then 
it is saved (step 350) . After determining whether or not to 
save the data point, a comparison is made to determine if all 
data points have been tested (step 360). If they have been 
then this subroutine ends (step 370), otherwise the value of i 
is increased by one (step 380) and endpoint determination 
continues. 
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^ illustrates the calculations used to determine 
the curvature at each data point between the stroke endpoints. 
The stroke contour is represented by line 410 while the 
individual digitizer data points are represented by points 420 
Point 430 is data point i, the data point of interest in this ' 
Illustration. The coordinates of 430 are and y.. 

In Fig. 4 line 440 is defined as the line' segment 
connecting point 430 with point 425. The coordinates of point 
425 are x^.^ and y._^, where d is equivalent to 3. Line 450 is 
defined as the line segment connecting point 430 to p6int 435 
Where point 435 has coordinates x^,, and y^,,. curvature angle 
460 IS the angle defined by lines 440 and 450. 

During stage one compression, the curvature angle for 
each data point is calculated, if the calculated curvature 
angle is within a certain range of angles, the data point is 
saved. If the angle does not fall within the pre-selected 
range of angles, the data point is discarded. The choice of 
the variable "d" as well as the range of threshold angles is 
determined by the application's requirements, m the preferred 
embodiment, the value of d is 2 ; data points which have a 
curvature angle between 5 degrees and 25 degrees are retained 
for applications requiring high fidelity, low compression 
(e.g., signature verification); and data points which have a 
curvature angle between 25 and 50 degrees are retained for 
applications requiring low fidelity, high compression (e g 
simple data storage) . ' 

Figs. 5-8 are detailed illustrations of optional data 
preprocessing methods. Fig. 5 is a flow chart of the method by 
Which duplicate data points are discarded. The digitizer data 
IS in the form of a sequence of coordinate values, x. and y. 
as a function of time (step 510) . The subscript i varies fJom 
1-0 to i=nsamps-l where nsamps is the total number of data 
points in a stroke. The subroutine is initialized by setting i 
equal to 1, x^^^ equal to x^ and y^^^ equal to y^ (step 520). 

The current data point, represented by x. and y. is 
compared to the previous data point, represented by x,, 'and 
Ysav (Step 530) . if the values for the x and y coordinates are 
identical, then the current point is discarded (step 540) and 
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'the next point is compared to x^,^ and y^^^. t^s process 
continues until either ,u of the data po^^ts have Len 
reviewed (step 5.0, or until and do not e^al x and 
Xsav- respectively. ™,en the two data points ar^ no r^Le" 
5 equivalent. X33, is set equal to x, and y is set e™^!\ 

(step S45) . The value of i is th.^ • ^ ° 

J increased by one (st«n Ben. 

and the new value of i is compared to the total nulr of da" 
Poxnts (Step 560) . The subroutine continues until data 
points have been reviewed at which ti™. ^h 7 
10 (step 570) . subroutine ends 

Often the beginning and ending of a stroke hav^ 

stroke contour 610 is nade up of a series „f 
poxnts e.o. Circle „o. centered at the stroke s .irst da'ta 
^0 poxnt 635. and circle 6„. centered at the strokes last dat 

Physical radius of thU c^rcl" T T e^=«Ment, the 

hu. Of data points con^rLVri^.V Le^s! ' 

~:nra\^:c\^r;teV^ - 

Therefore the thr^hMd t " "'"^ «ni„eter. 

.V .ultipiyin, iTde L 'p ::Lr,::r" - 

^ the tablet, resolution Xotts^ti^ 

Preprocess^q Jethod^. "xTe d^Uirr tt:"''"" 
sequence of coordinate values x ana v " " °' ° 

.step ,10,. .he subscript i ^ ^tntlp-s" ""^ 

3 rLrrin e^is'Inra".""""^ °' ^ ^ 

71.. . initialized by setting i equal to o (sten 

715) and setting j equal to i (step 720). 
The variable d is calcm ai-.=.>i / ^ 

- — ectsd threshort"^-;:-,r; 
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indicating that this point is within circle 630, the point 
corresponding to , y. is .i.^arded (step 750). The vate of 

step 730.""""' '^^'^ '''' calculation for d 

step 730) IS repeated. This process continues until d is not 
^ less then the threshold value, at which ti.e that vaL of 7 
Yj xs saved as the second data point (step 770). This 
completes the subroutine for endpoint filtering for the 
beginning of a stroke. 

The endpoint filtering method used for i-h« i ^ 
" o. a stroKe is identical ^o tjt used for 1 

:::::: Trr ^ - - 

nsa.ps-1 <step 7X5, and , is s« e^al to „sa.ps-. ,st.p „o, 

Median f utering is a preprocessing .ethod used in 
the preferred eBbcdi.ent to reduce the effects of tablet 

d't? ::ear ;r: - -sconti„uitri„ t. 

The dl Lter a s^/t r^rc" a""""""' 

-es. and .s a fun«LTo:\L:~ ro^™' 

.nitialx.ed by setting i equal to 1 (step S20) . 

^- ^" P"P"=^==t"9 method, a new value for each 

cl::::: l^-^—^ ^n the „edia„ value of thVee 

until the value If i • subroutine continues 

XI the value of i .s equal to the total number of samples 
mxnus two (step 860) at which time the subroutine is HZ] I 
(step 870) . ""JToutzne is complete 

30 :;^^^-err:;:n;^ri;-rr^';- - 

dxgitizer data is in the for^n = 

values ^ .nH sequence of coordinate 

lues Xi and y., as a function of time (step 910) The 

:otni:rr:a\a%:i;;^ '° - - 
3^ -..aii.ed b. s::::n:°r:;:: i:tz. ^= 

In this method, a linear process is anm 
noise and smooth the stroKe contouj. . ne^ valTe f or Tach ."^^ 
.nd y coordinate is calculated based on the initial values I 
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-three successive data points (step 9301 Th» „ , 
then increased by one (step ,„) LTIL k °' ' " 

until the value of i i. f subroutine continues 

.inus two Isle: llo] at r\ '"^ °^ -"P^- 

S (Step ..0, . ' ' " subroutine is complete 

st^rrrr °' — -d^rte^i:::- 

^° Fig. iiA is another original signature Pi„ mn 

-Ju^ertUr ^"-^ ~^.ro/;:e ^= 

00 t < " =^"'1^ points 1110. 



20 



point, prior to compression, the si,natur: .oarb":: 
Of storage. The points 1120 which have been TircTJ T 
■ points Which have been determined to be critical i^ .h 
reconstruction fidelity. Tig uc is ^ '° 
Signature based upon th'e oirc'ed" o ts" i.T^t^ . 

z-z — ~ :o~/:r - 

stage two compression technique. Wlying a 

letters. As'"in"ri ''uB T'"'" ' ^'^'""^ °' 

received fro^'h "ig I^, ^tLt: '"^ ^ P-"" 
.etermi ed to be critic Jto . 

::L:ar:e~::tr\?^°" ^ 

stage one compression^onttgrL^rr^/r T 
the original), and the reconstruction h °' 
both stage one anrf '=°nstruction shown xn 12c in which 

301 h.te! as percent" ^^^^^^ -™ 

automatical?; ^I^JTT'' "-^^^^ 

is ihe'LigL ora^:r::::ri" 

little structure Fia 1 J ^^"^ ^^^^^ contains 

to be critical to th! ^ determined 

xT^icai to the reconstruction fidelity Pi« -.-.^ • 
reconstruction based , -^J-aeiiry. Fig. 13c is th 

based upon circled points 1320 in which only 
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stage one compression has been applied. The reconstrucrion 
required only 6 bytes and represented a compression to 2 
percent of the original, m this case stage two compression 
was not applied because it increased the number of bytes to 14 
> Figs. 14A-E illustrates the trade between compression 

ratio and the fidelity of the reconstruction. Fig. is the 

original of a printed address which required a total of 1808 

IT °^ compression which 

employed a compression ratio similar to that used in Figs, nc 
and 12C. After both stage one and stage two compression, this 

::::::rr°" '™ °^ ----- --e 

gradually increasing levels. of compression. The reconstruction 
Shown m Fig. 14C, requiring only 9 percent of the original 
appears quite similar to the original. Figs. 14D-E althon.H 
.uite readable, show definite differences Lorn Z^or^^" 
The reconstructions in Figs. 14D and 14E require 8 perLnt and 
7 percent of the original storage space, respectively. 
Although some applications require high fidelity, such as 
signature verification, many applications could tolerate the 

ower fidelity accompanying the high compression ratios shol 
in Figs. 14D-E. 

Fig. 15 shows an embodiment of the apparatus to 
accomplish the method described in this invention. in the 
preferred embodiment, the computer 1550 is an IBM/Pc/AT 
compatible with an IBM compatible monitor 1540 and Keyboard 
1510. computer 1550 employs the methods disclosed by this 
invention to compress the electronic ink. Handwritten 
information is input either directly via the hardwired 

forrable digitizer tablPi-<= a>-o 

y x^er taoiets are m common use in many 

Depending upon the computing power of tablet 1530 the 
signature can either be i»ediately compressed using this 
invention's compression methods or temporarily stored for 
subseguent downloading to computer 1=50. once the handwritten 
data has been compressed, it can either be stored in computer 
1"0 or in separate memory 1560. Monitor 1540 allows the 
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" original handwritten data to be compared with the compressed 
data. 

The invention has now been explained with reference to 
specific embodiments. Other embodiments will be apparent to 
those of ordinary skill in this art in light of this 
disclosure. It is therefore not intended that this invention 
be limited except as indicated by the appended claims. 
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APPENDIX 

The computer program modules used to compress the 
electronic data are given in this appendix. Significant 
program modules and their specific functions are as follows: 

Stage one compression: 

calc_sav.c Function to determine which points in the 

signature need to be saved, based on various 
criteria. 

curvature. c Function to calculate the local curvature at a 
point along the contour. 

15 Stage two compression: 

cmpsig2.c Function to compress an array of stage 1 data 

with characteristics of the signature 

verification template data. 
pack_half bytes . c 

Function to pack the value to be compressed into 
half bytes. 

set_4_bits Function to pack values into the upper or 

lower 4 bits of a byte, 
get_4_bits Function to unpack values from the upper or 

lower 4 bits of a byte. 
uncmpsig2.c Function to uncompress an array of data 

compressed by the function cmpsig2.c. 
unpack_hal f bytes . c 

Function to unpack and reconstruct the value 
from half bytes. 
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/* CALC SAV.C 



* Function to determine which points in the signature need to 
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The results are 



* be saved, based on various criteria, 

* returned in array save_j)tsc. 

* For example, if the ith point of x[],y[] should be saved 

* then save_ptsC[i] = i, else save_pts[i] = O; 

* cur.comp returns: i (i.e. yes or TRUE) for successful 

* scaling 



•k 
* 



0 (i.e. NO or FALSE) if there is an 
error 



* input arguments: curvature^threshold, nsamps, numsegs, 

* segpnt, x, y 

it 

* output arguments: save_pts 



*/ 

#include <stdio.h> 
#include <cic-h> 

BOOL calc_sav ( curvature_thr eshold , 
y, savejtsC) 
COUNT curvature_threshold; 



COUNT nsamps; 



COUNT numsegs; 



COUNT segpnt[] ; 



COUNT x[] ; 



COUNT y [ ] ; 



TINY save_ptscr]; 



nsamps, numsegs, segpnt, x, 
/* threshold for curvature*/ 



COUNT i; 
COUNT k; 



/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 



compression ★/ 

number of samples */ 

(length) in input */ 

arrays x and y */ 

number of individual */ 

segments (strokes) */ 

pointers to beg of */ 

individual strokes */ 

input array of x */ 
values 

input array of y */ 

values * / 

contains information */ 

about what points to ★/ 

keep (returned to */ 

calling program) */ 

loop counter */ 

loop counter */ 
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COUNT value 1; 
COUNT value2; 
COUNT checks ; 
COUNT lower^limit; 

COUNT upper^limit; 

BOOL curvature ( ) ; 

COUNT size^angle; 

BOOL clockwiseB; 

BOOL calc_dir; 

COUNT dir^angle; 
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/* temporary variable 
/* temporary variable 
/* temporary variable 
/* defines starting pt 
/* for curvat compress 
/* defines ending pt for 
/* curvat compress 
/* function to calculate 
/* local curvature 
/* calculated curvature 
/* at a point 
/* YES if clockwise, NO 
/* otherwise 
/* if YES calculate 
/* dir_angleP 
/* direction of curvature */ 
/* bisector 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
V 
*/ 
*/ 



/ 



* Initialize the 
* 



savG_ptsC array. 



for (i = 0; i < 



{ 



nsamps; -f+i) 



save_ptsC[i] = i; 



* Perfoo, curvature based compression (determines which points 

* to keep of the stroke and saves the information in array 
- save_ptsc to return to the calling program) . 



*/ 



for (i = 0; i < numsegs; ++i) 
{ 

lower_limit = segpnt[i]; 
upper_limit = segpnt[i + 1]; 
while (lower_liinit < upper_lirait) 
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checkB = NO; 

for (k = lower^limit + 2; k < upper_liinit ; k += 2) 
{ 

curvature (X, y, (k + lower_lirait) / 2, 
5 nsamps, (k - lower_limit) / 2, 

&si2e_angle , &clockwiseB , &calc_dir , 
&dir_angle) ; 
if (size^angle > curvature_threshold) 
{ 

3^0 CheckB = YES; 

break; 
} 

) 

if (CheckB) 
^5 ( 

value2 = (k + lower_liinit) / 2; 

valuel = value2; 

) 

else 

{ 

valuel = segpnt[i +1] - i; 
value2 = (valuel + lower^limit) / 2; 



20 



25 



30 



35 



for (k = lower_limit + l; k < valuel; -n-hk) 
I 

if (k ]= value2) 

save_ptsC[k] = 0; 

) 

if (CheckB) 

lower_liinit = value2 ; 

else 

lower^limit = upper_liinit ; 

) 



/* 



to calling program with successful completion flag. 
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*— • 

V 

return (YES); 

/* 

* 

*/ 

) 



/* end of function */ 
/* CALC_SAV */ 

/* CURVATURE. C 



* Calculates the curvature, direction of curvature (angle) 

* bisector, and whether the change in angle is clockwise or 

* counter_clockwise at a point, defined by function argument 

* "index" into the arrays of coordinates points x[], y[] 

* representing the stroke contour. The quantities returned 

* are in tencs of the angle range 0-360 degrees. 

* If ERROR is detected the BOOL function value is YES (=1) , 

* while for a normal return the BOOL function value is NO ' 

* (=0) . 



* 



* CAUTION: when rhe curvature is 0 (i.e., calculated 

* size^angleP = o or close to 0), the calculation of the 

* direction of the curvature bisector is, of course, 

* inherently ambiguous and unreliable. 



* — 

*/ 



#include <stdio.h> 
#include <cic.h> 



BOOL curvature(x, y, index, maxpts, Ighseg, size_angleP, 
clockwiseBP, calc_dirB, dir_angleP) 

COUNT xf ] ; /* ^ ^ . 

/* X coordinate values of */ 

/* character */ 

COUNT yri ; /* ^. 

/* y coordinate values of */ 

/* character ★/ 

COUNT index; /. p,i„, in x, y to calc. */ 

/* curvature */ 
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/* 



COUNT maxpts; 



COUNT Ighseg; 



COUNT *si2e_angleP; 



BOOL *clockwiseBP; 



BOOL calc dirB; 



COUNT *dir_angleP; 

{ 

COUNT plus_point; 
COUNT ininus_point; 
COUNT plus_angle; 

COUNT minus^angle; 

COUNT dif_angle; 

COUNT cic_atan2() ; 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



/* maximum length of 
/* arrays x,y 
/* length of segment for 
/* curvatur calc 
/* calculated curvature 
/* at point index 
/* YES if clockwise, NO 
/* otherwise 
/* if YES calculate 
/* dir_angleP, otherwise 
/* otherwise return 
/* dir_angle = o 
/* direction of curvature */ 
/* bisector */ 

/* stores "index + Ighseg"*/ 
/* stores "index - Ighseg"*/ 
/* stores calc direction */ 
/* of upper seg. */ 
/* stores calc direction */ 
/* of lower seg. */ 
/* stores angle angle */ 
/* difference */ 
/* function to calculate */ 
/* angle in the */ 
/* range -1800 to 1800 */ 
/* (degrees x 10) ★/ 



* Check for errors in specifying the range for curvature 

* computation. 



V 



minus_point = index - Ighseg; 
plus_point = index + ighseg; 
if (minus_point < o [ | plus_point >= maxpts) 



{ 



*si2e_angleP = O; 
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/* 
*- 



*clockwiseBP = YES ; 
*dir_angleP = 0; 
return (YES) ; 



) 



/* ERROR - out of bounds, */ 
/* cannot calc. curvature */ 



* Calculation of the local curvature (i.e., angle ciiange) . 

* Note: the function cic_atan2 returns angle values in the 

* range -1800 to 1800, so it is necessary to divide by lo to 

* get the desired curvatures. 



*/ 



plus_angle = cic_atan2 (x[plus_point 1 - xfindexl, 

y[plus_point] - y[index]); 
minus_angle = cic_atan2 (x[ index] - x(ininus_point ] , 

y[index] - y (minus_point] ) ; 
dif_angle = plus_angle - minus_angle; 
if (dif_angle > 1800) 

dif_angle -= 3600; 
else if (dif_angle < -1800) 

dif_angle += 3 600; 
if (dif_angle < 0) 

{ 

*size_angleP = (-dif_angle + 5) / lO; 
*clockwiseBP = YES; /* by convention 

/* clockwise 
' /* angle changes are 

/* negative 



else 



{ 

*size_angleP = (dif_angle + 5) / lO; 
*clockwiseBP = NO; /* by convention 

/* counter-clockwise 
' /* angle changes are 

/* positive 



*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
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* Calculate the direction of the bisector of the local 

* curvature (i.-e., angle change) computed above in the range 

* 0-3 60 degrees. Note: the cic_atan2 returns angle values in 

* the range -1800 to 1800, so an adjustment is made in the 

* code below so that the resulting calculated bisector value 

* is in the range 0-360. 

* 

*/ 

if (calc_dirB) 
{ 

if (*cloc)cwiseBP == YES) 

*dir_angleP = plus_angle - (900 + (dif_angle - i) 
/ 2) ; 

else 

*dir_angleP = plus_angle + (900 - (dif_angle + i) 
/ 2) ; 

if (*dir_angleP < o) 

*dir_angleP += 3 600; 
*dir_angleP = (*dir_angleP + 5) / lo; 
if (*dir_angleP >= 360) 

*dir_angleP -= 360; 

) 

return(NO) ; ^^^mal return - no */ 

/* errors detected */ 



*/ 

) 

/* 

CMPSIG2 . C 

* 

* 



/* end of function */ 
/* CURVATURE. C */ 



Function to compress an array of data with charcteristics of 

* the signature verification template data. The companion 

* program to uncompress the data is UNCMPSIG2.C. NOTE: 

* compression/uncompression is exact (not lossy) and 

* restores the original data exactly. 
* 

* cmpsig2 returns: l (i.e. YES or TRUE) for successful 
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* compression 

* 0 (i.e. NO or FALSE) if there is an 

error 

5 * input arguments: 

* numin - number of elements (length) of data array to 

* be compressed 

* array - data array to be compressed 

* maxout - maximum allowed size of the compressed array 
^0 * comp_arrayUC[ 3 (to prevent overwriting 

* memory) 
* 

* output arguments: 

* num_comp_bytesP - ptr to # elements (each a byte) of 
^5 * the compressed data stored in 

* comp_arrayUC[ ] . 

* comp_arrayUC - unsigned byte array containing the 



* 

20 *- 



compressed representation of the input 
data in array [ ] ; 



#include <stdio.h> 
#include <cic.h> 

25 BOOL cmpsig2 (numin, array, maxout, num_comp_bytesP, 

comp_arrayUC) 

COUNT numin; /* number of points in input */ 

/* array */ 

COUNT array[]; /* array containing input */ 

/* data to be compressed */ 

COUNT maxout; /* max number of pts for */ 

/* comp_arrayUC */ 

COUNT *num_comp_bytesP; /* actual #bytes used in */ 

/* comp_arrayUC */ 

35 TBITS comp_arrayUC[] ; /* output array of compressed */ 

/* data ★/ 

{ 

COUNT i; /* loop index */ 



10 



15 



20 



25 



30 



35 
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COUNT nuin_sign_change; /* counts number of sign */ 

/* changes */ 

COUNT value; /* for general use temporary */ 

/* storage */ 

COUNT byte_part; /* i if upper 4 bits of byte, */ 

/* and 2 if lower 4 bits of */ 

/* byte */ 
TBITS * compear r ay UCP; /* ptr to comp_arrayUC of sig.*/ 

/* data */ 

TBiCTS byte^maskUC; /* byte used to temporarily */ 

/* hold compressed data */ 
COUNT last^index; /* saves last index of direct.*/ 

/* change */ 

COUNT current_sign; /* used in sign change */ 

/* determination */ 

COUNT last_sign; /* used in sign change */ 

/* determination */ 

BOOL set_4_bits ( ) ; /* function to pack values */ 

/* into the upper or lower */ 

/* 4 bits of a byte */ 

BOOL get_4_bits{) ; /* function to unpack values */ 

/* from the upper or lower */ 

/* 4 bits of a byte */ 

BOOL pack_halfbytes() ; /* implements main packing */ 

/* logic */ 

/* 

★ 

* Initializations for the compression process. 
* 

V 

*num_comp_bytesP = o ; 
*comp_arrayUCP = comp_arrayUC ; 

/* 

* — - 

* First, save the number of items in the array to be 

* compressed. This value is stored as two bytes. The first 

* byte is the upper 8 bits of the 16 bit COUNT value for 

* numin, and the second byte is the lower 8 bits. 
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* — — — — _ — — _ — _^ 

*/ 

*nuia_corop_bytesP += 1; 

*coinp_arrayUCP++ = (TBITS) ( (nuitiin » 8) & Oxf f ) ; 
5 *nuin_coinp_bytesP += l; 

*coinp_arrayUCP++ = (TBITS) (numin & Oxf f ) ; 

/* 

* ^ . 

* Find the number of sign changes (i.e., when successive 
10 * difference values have different signs) and save to 

* coinp_arrayUC. Note that the maximum number of sign changes 

* allowed is 255. The function immediately exits and returns 

* an error flag if > 255 sign changes are detected. 
* — ^ ^ 

15 */ 

num_sign_change = 0; 

last_sign = arraytl] - array[0]; 

for (i = 2; i < numin; ++i) 

\ 

20 current_sign = array [i] - array [i - 1]; 

if ( (current_sign < o && last_sign >= 0) |i 
(current_sign >= o last_sign < 0) ) 

{ 

last_sign = current_sign; 
2 5 ++num_s ign_change ; 

) 

) 

if (num_sign_change < 256) 
{ 

30 *num_comp_bytesP +=1; 

tcomp_arrayUCP++ = (TBITS) num_sign_change ; 



else 

{ 

fprintf (stderr, "\n\nCMPSIG ~ num_sign__change > 

256\n\n") ; 
return (NO) ; 

) 
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/* 

* '. 

* Initializations for compression 

* . „-„ 

5 */ 

byte_part = 1; 
byte_inaskUC = 0; 

/* 

* ^ 

^0 * Pack the sign of the first value of the input data (i.e., of 

* array [0]) 

★ 

*/ 

if (array[0] >= 0) 
15 value = i; 

else 

value = 0; 

pack_half bytes (value, &byte_part, &byte_maskUC, 
&comp_arrayUCF, nuin_coinp_bytesP) ; 

20 /* 

★ 

* Pack the sign of the first difference value (i.e., of 

* array [1] - array[0]) 

* 

25 */ 

last_sign = array [1] - array [ 0]; 
if (last_sign < 0) 
value =0; 

else 

30 value = l; 

pack_half bytes (value, &byte_part, &byte_inaskUC, 
&coinp_arrayUCP, nuni_coinp_bytesP) ; 

/* 

*- 

35 * Pack the sign change positions (where successive difference 

* values have different sign) . 



*/ 
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last_index = O; 
for (i = 2; i < numin; ++i) 
{ 

current_sign = array [i] - array [i - 1]; 
if { (current_sign < 0 && last_sign >= O) || 
(current_sign >= o && last^sign < o) ) 

{ 

last_sign = current^sign; 
if ((i - last_index) > 255) 
{ 

fprintf (stderr, "\n\nCMPSIG ~ sign index 

change > 255\n\n") ; 
return (NO) ; 

pack_half bytes (i - last_index, &byte_part, 
&byte_maskUC, &coinp_arrayUCP, 
nuin_coinp_bytesP) ; 

last_index = i; 

) 

^ /* end of loop to pack sign */ 

/* change position values */ 



/ 



Pack differences values into bytes and save to comp^arrayUC 



* 

V" 

for (i = 0; i < 



numm; 



if (i == 0) 

value = abs ( array [ 0] ) ; 

else 

value = abs(array[i] - arrayfi - 1]); 
pack_halfbytes (value, &byte_part, &byte_maskUC, 

&coinp_arrayUCP, nuin_comp_bytesP) ; 
if (*nuin_coinp_bytesP >= maxout) 

{ 

fprintf(stderr,"\n\nCMPSIG - coinp_arrayUC too 
large\n\n") ; 
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return (NO) ; 



15 



/* end of loop to pack */ 
/* difference values */ 



/* 



* Finally, if the data ends on an odd number of half bytes, 

* roust save one final byte because the last signature data 

* value is in the first half byte of byte^maskUC and hasn't 
10 * yet been saved to comp^arrayUC. 

* — 

V 

if (byte_part == 2) 
{ 

set_4_bits( (UTINY) 15, 2, &byte_maskUC) ; 
*nuin_coinp_bytesP += 1; 
if (*num_coinp_bytesP > maxout) 
{ 

fprintf (stderr, "\n\nCMPSIG ~ comp^arrayUC too 



20 large\n\n") ; 

return (NO) ; 



25 /* 

* 
* 

30 *. 



*coinp_arrayUCP++ = byte__maskUC; 
) 



If this point is reached the compression process has 
proceeded normally and no errors have been detected, so 
return with a "successful completion" status flag. 



return (YES) ; 



* — 
*/ 



3 5 \ 

' /* end of function CMPSIG2 */ 

/* PACK_HALFBYTES . C 
★ - — — 



* Function to pack a COUNT value into half byt. 
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* 
* 
* 

*/ 



pack_half bytes returns: l (i.e. YES or TRUE) for 

successful packing 
0 (i.e. NO or FALSE) if 
there is an error 



BOOL pack_half bytes (value, byte_partP, byte_maskUCP, 
comp_arrayuCPP, num_coinp_bytesP) 
COUNT value; /* value to be compressed 



*/ 
*/ 



COUNT *byte_partP; /* i if ^pper 4 bits of byte, 

/* and 2 if lower 4 bits of 
/* byte 

TBITS *byte_inaskUCP; /* packed'byte representing 

/* different values ..^ 

TBITS **coinD arrayUCPP; /* m-T- -ho 

yuv-t-f, / per to ptr to coinp_arrayUC */ 

/* element 

COUNT *num_comp_bytesP; /* actual #bytes used in */ 

/* comp_arrayUC 



*/ 
*/ 
*/ 
*/ 



UTINY tempuc; /* temporary value */ 

UTINY continuation_labelUC;/* used to save continuation*/ 



/* label 

BOOL get_4_bits(); /* function to unpack values */ 

/* from the upper or lower 4 */ 

/* bits of a byte */ 

BOOL set_4_bits(); /* function to pack values */ 

/* into the upper or lower 4 */ 

/* bits of a byte */ 

if (value < 14) 

{ 



/* 



* Pack 4 bit value into byte. When the absolute difference 

* value IS < 14, then it can be packed into a half byte (4 

* bits) . - i K 



*/ 
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set_4_bits((UTlNY) value, *bvte nan-P k ^ 

if ^ oyte_pan:P, byte maskUCP) ; 

If (*byte_partP == 2) _ '^y , 

{ 

*byte_partP = j.; 

*nuin_coinp_bytesP += i* 
**coinp_arrayuCPP = *byte_maskUCP; 
++(*coinp_arrayUCPP) ; 
) 

else 

*byte_partP = 2; 

) 

else 

{ 



* Pack 4 bit continuation label into byte wh^n ^-l Z 

n^. ,,=,1,,. . ^^^"^ the absolute 



difference value i 
* previous block above deals with values" 



^ <= 30 (and, Of course, >= 14 since the 



_ x-^axa witn values < 14^ ^-k^^ 



half byte, and write the remainder 

va-Lue xs >= 30, pack 15 into the first half hv^'^^'T^ 
the remainder into another byte if the ab . ""^'^ 

* value is > 14 + 255 - ll absolute differenc 

* the function .:: Lt"eiT;x t:"a:: IT 

* condition. "'^ ^^"^"^"^ 



*/ 



if (value > 269) 
{ 

^ fprlntf (stderr, "VnVnCMPSIG - dif value out of 

fprintffstaerr. "range (= %d,x„x„.., .^lue) ; 

return (NO) ; /* ^ 

^ /* return on ERROR detection */ 

else if (value < 30) 

continuation_labelUC = (UTINY) 14; 

else 

continuation_labelUC = (UTINY) 15; 
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* First, set half byte for continuation label. 

★ — 

5 */ 

set_4_bits ( cont inuation_labelUC , *byte j»artP , 

byte_inaskUCP) ; 
if (*byte_partP == 2) 

{ 

10 *byte_partP = i; 

*nuin_coinp_bytesP 1; 
**coinp_arrayUCPP = *byte_inaskUCP; 
-i-+(*coinp_arrayUCPP) ; 

) 

15 else 

*byte_partP = 2; 
if (value < 30) 
{ 



/* 

20 



* Pack a value corresponding to (value - 14) into a second 

* half byte (i.e., a value from 14 to 29 will be converted to 

* a value from o to 15 for packing) . 
* 

25 */ 

set_4_bits( (UTINY) (value - 14), *byte_partP, 

byte__inaskUCP) ; 
if (*byte_partP == 2) 
{ 

^0 *byte_partP = i; 

*nuin_coinp_bytesP += i; 
**coinp_arrayUCPP = *byte_inaskUCP ; 
++(*coinp_arrayUCPP) ; 
) 

35 else 

*byte_partP = 2 ; 

) 

else 



wo 94/03853 PCT/US93/06883 

31 



/* 



* If the original absolute difference value is >= 30, then 

* must pack two more half bytes in additional to the' 

* continuation half byte. 



*/ 

10 



if (*byte_partP == i) 
{ 

*byte_inaskLTCP = (tbiTS) (value - 14); 

*nuin_coinp_bytesP += i; 
**comp_arrayUCPP = *byte_inaskUCP; 
++(*comp_arrayUCPP) ; 
) 

else 
{ 

get_4_bits(&tempUC, l, (TBITS) (value -14)) ; 
set_4_bits(teinpUC, 2, byte_iiiaskUCP) ; 
*num_corap__bytesP += i; 
**coinp_arrayUCPP = *byte_inaskUCP; 
++(*coinp_arrayUCPP) ; 

get_4_bits(&teinpUC, 2. (TBITS) (value - 14)); 
set_4_bits((UTINY) tempUC, l, byte.maskUCP) ; ' 

) 

) 

return (YES); ^* value unpacked successfully*/ 

/* end of pack halfbytes */ 
/* UNCMPSIG2.C ~ ^ 



* Function to uncompress an array of data compressed by 

* function CMPSIG2. NOTE: compression/uncompression is exact 
(not lossy) and restores the original data exactly. 



uncmpsig2 returns: i (i.e. YES or TRUE) for 

successful uncompress 
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0 (i.e. NO or FALSE) if 
there is an error 



* input arguments: 

5 * comp^arrayUC - unsigned byte array containing the 

* compressed representation of the data 

* (as created by CMPSIG2.C) 

* output arguments: 

10 * numoutP - ptr to number of elements (length) of the 

* uncompressed data 



* 

15 * 



* array - data array created by unpacking and 

reconstructing values taken from the 
compressed array comp_arrayUC[ ] , 



*/ 

# include <stdio.h> 
^include <cic.h> 



20 BOOL uncmpsig2 (numoutP, ai 
COUNT *numoutP; 

COUNT array [] ; 

5 

TBITS comp_arrayUCt ] ; 
{ 

0 COUNT i; 

COUNT kount_bytes; 

TBITS *comp_arrayUCP; 

5 COUNT num_sign_change; 

COUNT value; 



, comp_arrayUC) 



/* 


number of points in output 


*/ 


/* 


array 


*/ 




array for output signat 


*/ 


/* 


data that has reconst from 


*/ 


/* 


comp_arrayUC by reversing 


*/ 


/* 


the compression proc. 


*/ 


/* 


compressed input array of 


*/ 


/* 


sig data 


*/ 


/* 


loop index 


*/ 


/* 


kounts bytes used in 


*/ 


/* 


comp_array 


*/ 


/* 


ptr to comp_arrayUC of sig 


*/ 


/* 


data 


*/ 


/* 


counts number of sign 


*/ 


/* 


changes 


*/ 


/* 


for general use temporary 


*/ 


/* 


storage 


*/ 



10 



15 



20 



25 



30 



35 
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COUNT byte_part; /* 1 if upper 4 bits of byte, */ 

/* and 2 if lower 4 bits of */ 

/* byte */ 

TBITS byte_inaskUC; /* packed byte representing */ 

/* different values */ 

COUNT sign_change[256]; /* array of sign change */ 

/* indices */ 

COUNT *sign_changeP; /* pointer to sign change */ 

/* index array */ 

COUNT sign_first_dif ; /* saves sign of first */ 

/* difference */ 

COUNT current_sign; /* current value of sign */ 

BOOL unpack_halfbytes() ; /* implements main unpacking */ 

/* logic */ 

/* 

* — — « 

* Initializations for compressed array 
★ 

*/ 

kount_bytes = O; 
coinp_arrayUCP = coinp_arrayUC ; 

/* 

★ 

* First, get the number of items in the array that was 

* compressed. This value is stored as two bytes. The first 

* byte is the upper 8 bits of the 16 bit COUNT value for 

* numout, and the second byte is the lower 8 bits. 
* 

*numoutP = 0; 

*numoutP = *comp_arrayUCP++ ; 

*numoutP <<= 8; 

*numoutP = *numoutP | (*comp_arrayUCP++) ; 

kount_bytes += 2 ; 
/* 

* 



* Get the number of sign changes 
★ 
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*/ 

nuin_sign_change = (COUNT) (*conip_arrayUCP++) ; 
++kount_bytes ; 

/* 

5 * 

* Check for error condition. If the value read in for'thl' 

* number of sign changes is >= 256, then an error has 



* occurred. 
* — 

10 */ • 



if {nuin_sign_change < o j j nuin_sign_change >= 256) 

fprintf(stderr, "\n\nUNCMPSIG ~ number of sign 
changes out ") ; 

15 fprintf(stderr, "of range (= %d) is >= 256\n\n", 

num_sign_change) ; 
return (NO); return on ERROR detection */ 



Initializations for unpacking. 



/* 

20 * 
* 

* 

*/ 

byte_part = i; 

25 /* 

* 

*, unpack the sign of the first value of the original" 
* uncompressed data. 

* 

30 */ 

unpack_halfbytes (lvalue, &byte_part, &byte_maskUC, 

&comp_arrayUCP, &kount_bytes) ; 
if (value == o) 

current_sign = -i; 

35 else if (value == i) 

current_sign = i; 

i -. se 
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/* 






* Error condition since 


value i= 0 (for minus sign) and also 


• J. 


^ i-ui pxus 5j.gn^ • 










*/ 








fprintf (stderr, 


"\n\nUNCMPSIG — initial sign value ") ; 




fprintf (stderr , 


"is inconsistent (= %d)\n\n", value); 




return (NO) ; 


/* return on ERROR detection */ 


10 


) 


/* 







* Unpack the sign of the first difference value of the 

* original, uncompressed data. 

15 * 



*/ 

unpack_half bytes (Scvalue, &byte_part, £cbyte_maskUC, 
&comp_arrayUCP , 
&kount_bytes) ; 
20 if (value == 0) 

sign_f irst_dif = -i; 
else if (value ~ i) 

sign_f irst_dif = 1; 

else 



25 




{ 






/* 


















Error condition since 


value != 0 (for minus sign) and also 




* 


!= 1 (for plus sign) . 




30 










V 










fprintf (stderr, 


"\n\nUNCMPSIG ~ initial sign value ") ; 






fprintf (stderr, 


"is inconsistent (= %d)\n\n", value); 






return (NO) ; 


/* return on ERROR detection */ 


35 




) 




/* 







* Unpack sign change positions (where successive difference 
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* values have different sign) . 



for (i = 0; i < nuin_sign_change; ++i) 
{ 

unpack_hal f bytes ( &value , &byte j>art , &by te^maskUC , 
&comp_arrayUCP, &kount_bytes) ; 

if (i == 0) 

sign_change[0] value; 
else s 

sign_change[i] = sign_change[ i - 1] + value; 
) /* end of loop to unpack sign */ 

/*change position values */ 



* 

*/ 



Unpack differences from bytes and read from disk. 



sign_changeP = sign_change;/* pointer to sign change */ 

/♦index array */ 
for (i = 0; i < *numoutP; ++i) 

{ 

unpack_half bytes (& value, &byte_part, &byte_inaskUC, 

&coinp_arrayUCP , &kount_bytes) ; 
if (i > 1 && nuin_sign_change > 0 && i == *sign_changeP) 
( 

current_sign = -current_sign; 
— nuin_sign_change ; 
++sign_changeP ; 
) 

if (i == 0) 

arrayfi] = current_sign * value; 
else if (i == i) 

{ 

sign_chav -p = sign_change; 

array [i;, = array [i - l] + sign_f irst_dif * value; 
current_-.gn = sign_first_dif ; 

) 



wo 94/03853 PCr/US93/06883 

37 

else 

array [i] = array [i - 1] + current_sign * value; 
) /* end of loop to unpack and */ 

/* reconstruct data */ 

5 /* 



* Check for inconsistency in specified size (in bytes) of the 

* input compressed array (comp^array) versus the actual number 

* of bytes used in the uncompress process to create the output 
10 * uncompressed array. 

* ^ 

if (kount_bytes != num_comp_bytes) 

{ 

fprintf (stderr, "\n\nUNCMPSIG ~ kount_bytes (= %d) 
15 != kount_bytes) ; 

fprintf (stderr, "num_comp_bytes (= %d)\n\n", 
num_comp_bytes) ; 

return (NO) ; 
) 

20 */ 
/* 

* — — — 

* If this point is reached the uncompression process has 

* proceeded normally and no errors have been detected, so 

25 * return with a ''successful completion" status flag. 

* 

*/ 

return(YES); 

/* 

) /* end of function UNCMPSIG2 */ 
UNPACK_HALFBYTES . C 
* 

* Function to unpack and reconstruct a COUNT value from half 

* bytes. 
35 ★ 



★ 



* 



pack_half bytes returns: 1 (i.e. YES or TRUE) for 

successful unpacking 
0 (i.e. NO or FALSE) if 
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there is an error 
* — — . 

*/ 

BOOL unpack_halfbytes(valueP, byte_partP, byte_maskUCP , 



5 




comp_arrayUCPP, kount 


bvt©sP^ 






COUNT 


*valueP; 


/ 


contains uncompressed 


*/ 








/* 


vaxuc uo De returneci to 


*/ 








/* 


main program 


*/ 




COUNT 


*bv"te DartP: 


/ 


1 if upper 4 bits of byte, 


V 


10 






/ 


ana 2 ir lower 4 bits of 


*/ 








/ 


byte 


*/ 




TEXTS 


*byte maskUCP; 




pacKea oyre representing 


*/ 








/* 


different values 


*/ 




TRTTc: 
X ox X o 






ptr to comp_arrayUC of sig. 


*/ 


15 








aa^a 


*/ 




COUNT 


*kount: bvtesP* 




kounts bytes used in 


*/ 




{ 




/* 


coinp_array 


*/ 




UTINY 


tempUC ; 


/* 


temporary value 


*/ 


20 


UTINY 


valueUC; 


/* 


temporary value 


*/ 




BOOL 


get_4_bits() ; 


/* 


function to unpack values 


*/ 








/* 


from the upper or lower 4 


*/ 








/* 


bits of a byte 


*/ 




BOOL 


set_4_bits() ; 


/* 


function to pack values 


*/ 


25 






/* 


into the upper or lower 4 


*/ 








/* 


bits of a byte 


*/ 



/* 



* 

* Unpack 4 bits. When the value is < 14, it corresponds to a 

* legitimate difference value, and hence the next array value 

* can be calculated by adding the difference (with the correct 

* sign) to the preceding array value. If the value == 14, the 

* absolute difference value is >= 14 and <= 29, hence another 

* half byte must be read in to compose the absolute 

* difference. if the value == 15, the absolute difference 

* value is >= 30 and < 269, hence another full byte (i.e., 2 

* half bytes) must be read in to compose the absolute 

* difference. 
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10 



*/ 

if (*byte_partP == i) 

{ 

++(*)count_bytesP) ; 
*byte_inaskUCP = **coinp_arrayUCPP; 
++(*coinp_arrayUCPP) ; 
) 

get_4_bits(&valueuc, *byte_partP, *byte_inaskUCP) ; 
*byte_partP = 3 - *byte_partP; 
if (valueUC < 14) 

*valueP = (COUNT) valueUC; 
else if (valueUC == 14) 
{ 

15 if (*byte_partP == i) 

{ 

*byte_maskUCP = **comp_arrayUCPP; 
++(*coinp_arrayUCPP) ; 
++(*kount_bytesP) ; 
) 

get_4_bits(&valueuc, *byte_partP, *byte_inaskUCP) ; 
*valueP = (COUNT) (valueUC + 14); 
*byte_partP = 3 - *byte_partP; 
) 

25 else if (valueUC == 15) 

{ 

if (*byte_partP == i) 

{ 

*byte_maskUCP = **coinp_arrayUCPP; 
++(*coinp_arrayUCPP) ; 
++(*kount_bytesP) ; 

) 



20 



get_4_bits(&t6inpUC, *byte_partP, *byte_maskUCP) ; 
set_4_bits(terapUC, i, &valueUC) ; 
*byte_partP = 3 - *byte_partP; 
if (*byte_partP == i) 
{ 
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*byte_maskUCP = **coinp_arrayUCPP; 
++(*coinp_arrayUCPP) ; 
++(*kount_bytesP) ; 

) 

get_4_bits(.te.puc, *byte_partP, *byte .asKUCP, • 
set_4_bits(teinpuc, 2. SvalueUC) ; 
*valueP = (COUNT) (valueUC + 14) • 
*byte_partP = 3 - *byte_partp; 

10 else 

{ 

*byte_inaskUCP = **comp_arrayuCPP; 
++(*coinp_arrayUCPP) ; (*kount_bytesP) ; 

return (YES) ; /* ,,=,1, 

/ ^^^""^ unpacked successfully */ 



15 



/*■ 

V 

^ /* end Of function UNPACK_HALFBYtes */ 

20 /* GET_4_BITS.C 



* 



Function that returns the UTINY v?,ino 

either the upper 4 bits in t^ • —-Pending to 

^ . . oiTzs in the input bvte nr- i 

. tL ;„p" : ""^ ^ ^" range o to 3 

me input arguement "DositT nr." xi^^ 'tJ ^ ir*, 

* or lower 4 bit. i. taKe" - "hether the upper 



position =1 

upper four bits (or upper 

30 * "nybble") 
position = 2 — 1 

lower four bits (or upper 

^ "nybble") 

V 

#include <stdio.h> 
35 =include <cic.h> 

BOOL get_4_bits (valueUCP nnc:, *-,- 

UTTMv 7 , position, byte_maskUC) 

UTINY *valueUCP; ^ 

/* set the four bits to this 
/* value 



*/ 
*/ 
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COUNT position; /* i 

/* 1 specifies the upper 4 */ 

/* bits, and 

TBITS byte_»askUC;. /* feature .as)c for template */ 

/* selection 2 specifies the */ 

/* lower 4 bits 
{ / 

/* 

* 

*jetum original mask if error condition found 
*/ 

if (position != 1 4& position != 2) 
return (NO) ; 

/* 

* 

* 

V 

if (position == i) 

byte_maskUC »= 4; 
*valueUCP = (UTINY) (byte.maskUC & Oxf ) ; 

return(YES) ; '"^^ "^^^^ V 

/* 

*/ 

*/ ' /* °f function GET_4_BITS.C*/ 

* SET_4_BITS.C 
* 

^ specified by the input argument "value". 



wo 94/03853 PCT/US93/06883 

42 

* position = 1 — upper four bits (or upper 

* "nybble") 

* position =2 — lower four bits (or upper 

* "nybble") 

5 * . 

* NOTE: function returns a boolean YES for normal operation 

* and NO if an error is detected. 

* ^ 

*/ 

10 # include <stdio.h> 
# include <cic.h> 

BOOL set_4_bits (valueUC, position, byte_inaskUCP) 

UTINY valueUC; /* set the four bits to this */ 

/* value */ 
15 COUNT position; /* specifies the position of */ 

/* the two bits */ 
TBITS *byte_inaskUCP; /* feature TTiask for template */ 

/* selection */ 

{ 

20 /* 

* — — — 

* Return original mask if error condition found 
★ 

*/ 

25 if (valueUC > (UTINY) 15) 

return (NO) ; 
if (position 1= 1 && position i= 2) 
return (NO) ; 

/* 

30 * 

* Set bits as specified by the input arguments 
* 

*/ 

if (position == l) 
35 { 

*byte_maskUCP &= *(OxfO); 

/* zero upper 4 bits inbyte^maskUCP*/ 
*byte_maskUCP j= (valueUC << 4); 
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/* set upper 4 bits to 



new value */ 



else 



{ 



*byte_maskUCP &= "(Oxf); 

/* zero lower 4 bits in byte^maskUCP*/ 
*byte_maskUCP |= valueUC; 





) 

return (YES) ; 


/* set 


upper 4 bits to 


new value 


*/ 


10 










/*■ 












/* 


) 


/* end 


of function SET 


.4_BITS 


*/ 














15 * 
★ 
*- 


This file contains 
constants , macros , 


standard 
and data 


type definitions 
structures. 


f defined 





register 



*/ 

#define FAST 
20 #define GLOBAL 

#define IMPORT extern 
#define THISFL extern 
#define INTERN static 
#define LOCAL 

25 

typedef void 
typedef char 
typedef short 
typedef int 
30 typedef long 



static 
VOID; 

TBOOL, TINY, TEXT; 
COUNT, SHORT; 
INT, ARGINT, METACH; 
LONG; 



typedef unsigned char UTINY, TBITS, UTEXT; 

typedef unsigned short BITS, UCOUNT, USHORT, SIZETYPE; 

typedef unsigned int BYTES, UINT; 

3 5 typedef unsigned long ULONG, LBITS ; 



typedef float FLOAT; 
typedef double DOUBLE; 
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typedef USHORT ERCTYPE ; 

typedef enum 

{ 

5 NO = 0, 

YES = 1 
) BOOL; 

typedef enum 
10 { 

SUCCEED = 0, 

FAIL = 1 

) SUCCESS; 

15 #ifndef abs 

#define abs(x) (((x) < 0) ? -(x) : (x) ) 

#endif 

#ifndef max 

#define max(x,y) (((x) < (y) ) ? (y) : (x) ) 
20 #endif 

rifndef min 

#define min(x,y) (((x) < (y) ) ? (x) : (y) ) 
*fendif 

25 #define FOREVER while (YES) 



#def ine EOS ( *\0' ) 
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WE CLAIM: 

1. A method for electronically compressing 
handwritten data expressible as a sequence of data points, the 

5 method comprising the steps of: 

capturing a parametric representation of data 
handwritten on a digitizer tablet; 

determining local extrema and saving representations 
of said local extrema as first data points; 

determining stroke endpoints and saving ' 
representations of said stroke endpoints as second data points; 
and 

calculating curvature locally about selected data 
points between said second data points and saving only selected 
representations of said selected data points based on 
curvature. 

2. The method according to claim l wherein said 
parametric representation of the digitizer data is a sequence 
of X and y coordinate values in a Cartesian coordinate system 
as a function of time. 



10 



20 



3. The method according to claim i wherein said 
local extrema determining step comprises selecting minimum x^ 
2 5 and y^ and maximum x^ and y^ according to the following 
relationships: 



30 



35 



minimum = 


^i 


< 


^i+1 


and 


^i 


< 


^i-l' 


maximum = 


^i 


> 


^i+l 


and 


^i 


> 


^i-l' 


minimum = 


yi 


< 


yi+1 


and 


Vi 


< 




maximum = 


yi 


> 


yi+i 


and 


yi 


> 


^i-l' 



where 

Xi and y . over the range i = 0 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

4. The method according to claim 1 wherein said 
stroke endpoint determining step comprises selecting a first 
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'*endpoint of a stroke, Xq and Yq, and selecting a last endpoint 
Of a stroke, ^^xsa^p^-i ynsamps-l' 
where 

and over the range i = 0 to i = nsamps-l is the 
5 discrete representation of a stroke, and 
nsamps is the number of points. 

5. The method according to claim 1 wherein said 
curvature determining step comprises calculating an angle 

10 between adjacent data points according to the relationship: 
^i " ^i+d " ^i-d' 

where 

= Cj^ - 360 degrees if > 180 degrees, 
= + 360 degrees if < -180 degrees, 
^i+d = arctan (x^^^ - x^, y^^^ - y . ) , 
^i-d = ^"tan (Xi . x^.^, y. - y . , 
d is an integer no larger than 4, 

x^ and y^ over the range i = d to i = nsamps-l-d is 
the discrete representation of a stroke, and 
nsamps is the number of points. 

6. The method according to claim 5 wherein said 
curvature angle calculating step further comprises discarding 
data points with a curvature angle less than 5 degrees or 

2 5 greater than 25 degrees, 

7. The method according to claim 5 wherein said 
curvature angle calculating step further comprises discarding 
data points with a curvature angle less than 2 5 degrees or 

3 0 greater than 50 degrees, 

8. The method according to claim 1 further 
including a preprocessing step of discarding duplicate data 
points according to the following relationships: 

35 if Xi = x^,^ and y^ = y._^, 

then discard and y^; 
if ^ ^i^i and y^ * y._^, 
then save x^ and y^^; 
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'Where 

Xi and over the range i = 1 to i = nsamps-l is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

9. The method according to claim 1 further 
including a preprocessing step of discarding data points 
according to the following procedure: 

set j = 1, 

calculate d = { (Xp - Xj)2 + (y^ _ y.)2^h^ 

if d < 5, then discard point corresponding to index 

j, set j = j + 1^ and recalculate d; 

if d > 5, then shift remaining points down so that 

the point (Xj, Yj) corresponds to (x^^, y^) , and quit the 

calculation, 

where 

and over the range i = 0 to i = nsamps-l is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

10. The method according to claim 1 further 
including a preprocessing step of discarding data points 
according to the following procedure: 

set j = 2, 

calculate d = ( (x^ .^2. 

^ ^ ^nsamps-l ^nsamps- j J ^ 

^ynsamps-l"ynsamps- j ) ^ ) ' 
if d < 5, then discard point corresponding to index 

3, set j = j + 1^ and recalculate d; 

If d > 5, then shift remaining points down so that 

the point (x^samps-^' ^nsamps-j) corresponds to (Xns3j^pg_2' 

ynsamps-2^ quit. the calculation. 

11. The method according to claim 1 further 
including a preprocessing step of dividing data points between 
said first data points into groups of three adjacent points and 
calculating the median value of said group according to the 
following relationships: 

Cj = median value of (Xj_^, x ^ , Xj^-^J' 



X- 
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" Vj = median value of (Yj^i/ y j , Yj + i). 

where 

and over the range i = 1 to i = nsainps-2 is the 
discrete representation of a stroke, and 
5 nsamps is the number of points. 



12 • The method according to claim 1 further 
including a preprocessing step of dividing data points between 
said first data points into groups of three adjacent data 
points and calculating a representative value for said group 
according to the following relationships: 

Xj = 0.25Xj_3^ 0.5Xj + 0.25Xj+3^, 

Yj = 0.25yj_^ + 0.5yj + 0,25yj^^. 

where 

x^ and y^ over the range i = 1 to i = nsaTnps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

13. An apparatus for electronically compressing 
handwritten data expressible as a sequence of data points, the 
apparatus comprising: 

means for capturing a parametric representation of 
data handwritten on a digitizer tablet; 

means for determining local extrema and saving 
representations of said local extrema as first data points; 

means for determining stroke endpoints and saving 
representations of said stroke endpoints as second data points; 
and 

means for calculating curvature locally about 
selected data points between said second data points and saving 
only selected representations of said selected data points 
based on curvature. 



14, The apparatus according to claim 13 wherein said 
35 parametric represe . ation of the digitizer data is a sequence 
of X and y coordir e values in a Cartesian coordinate system 
as a function of tii-ae. 
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10 



"^^^ apparatus according to claim 13 wherein said 
local extrema determining means comprises means for selecting 
minimum Xi and and means for selecting maximum x. and y. 
according to the following relationships: ^ 
minimum = x^ < x^^^ and x^ < x^.^, 
maximum = x^ > x^^^ and x^ > x^.^, 
minimum = y^ < y.^^ and y. < y._^, 
maximum = y^ > y.^^ and y^ > y. 

where 

Xi and yi over the range i = 0 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

apparatus according to claim 13 wherein said 
15 stroke endpoint determining means comprises means for selecting 
a first endpoint of a stroke, x^ and y^, and means for 
selecting a last endpoint of a stroke, x and v 

where ''nsamps-l ^nd ynsamps-i' 

Xi and yi over the range i = 0 to i = nsamps-1 is the 
20 discrete representation of a stroke, and 
nsamps is the number of points. 

17. The apparatus according to claim 13 wherein said 
curvature determining means comprises means for calculating an 
25 angle between adjacent data points according to the 
relationship: 

" ^i+d ■ ^i-d' 

where 

Ci = Ci - 3 60 degrees if Ci > 180 degrees, 

" ^i degrees if Ci < -I80 degrees, 

ai^d = arctan (Xi^^ - Xi, yi^^ - y . ) , 
ai.d = arctan (Xi - Xi.^, yi - y , 
d IS an integer no larger than 4, 

Xi and y. over the range i = d to i = nsamps-i-d is 
the discrete representation of a stroke, and 
nsamps is the number of points. 



35 
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18- The apparatus according to claim 17 wherein said 
curvature angle calculating means further comprises means for 
discarding data points with a curvature angle less than 5 
degrees or greater than 25 degrees, 

5 

19. The apparatus according to claim 17 wherein said 
curvature angle calculating means further comprises means for 
discarding data points with a curvature angle less than 25 
degrees or greater than 50 degrees. 



15 



20. The apparatus according to claim 13 further 
including a preprocessing means for discarding duplicate data 
points according to the following relationships: 

if = XjL_i and = y^.^, 
then discard x^^ and y^; 
if x^ * Xi_T^ and y^ ^ y^.-^, 
then save and y^^; 

where 

^i and y^ over the range i = 1 to i = nsamps-1 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

21, The apparatus according to claim 13 further 
25 including a preprocessing means for discarding data points 

according to the following procedure: 
set j = 1, 

calculate d = { (Xq - Xj ) 2 + (y^ - ) 2 j ^ 

if d < 5, then discard point corresponding to index 
30 j, set j = j + 1, and recalculate d; 

if d > 5, then shift remaining points down so that 
the point (Xj, y j ) corresponds to (X;^, y^^) , and quit the 
calculation, 
where 

-^^ and y : over the range i = 0 to i = nsamps-1 is the 

discrete representation of a stroke, and 
nsamps is the number of points. 
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"22. The apparatus according to claim 13 further 
including a preprocessing means for discarding data points 
according to the following procedure: 
set j = 2, 

calculate d = { (x«„,„«„ ,-x •)^+ 
* ^^nsamps-i ^nsamps-^^ ^ 

^ynsamps-i'^nsamps- j ) ^ ) ^ * 
if d < 5, then discard point corresponding to index 

j, set j = j + 1, and recalculate d; 

if d > 5, then shift remaining points down so that 

the point (x„3^„p3.j, Ynsanps-j) corresponds to (x^^^^^^.^' 

ynsamps-2) 1"^^ calculation. 

23. The apparatus according to claim 13 further 
including a preprocessing means for dividing data points 
between said first data points into groups of three adjacent 
points and means for calculating the median value of said group 
according to the following relationships: 

Xj = median value of {Xj.^, Xj, Xj^^^), and 
Yj = median value of {yj.^, y j , Yj+i). 

where 

Xi and y^ over the range i = i to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 

24. The apparatus according to claim 13 further 
including a preprocessing means for dividing data points 
between said first data points into groups of three adjacent 
data points and means for calculating . a representative value 
for said group according to the following relationships: 

Xj = 0.25Xj.;^ + o.5Xj + 0.25Xj^.^, and 
yj = 0.25yj_^ + o.5yj + 0.25yj^^. 

where 

x^ and y^ over the range i = 1 to i = nsamps-2 is the 
discrete representation of a stroke, and 
nsamps is the number of points. 
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